Proactive Thermal-Aware Multicore Scheduling Using Job Migration and Power-Gating
نویسندگان
چکیده
Overheating multi-core chips is known to accelerate their failure. To improve the reliability of advanced embedded real-time systems built with multi-core chips, dynamic thermal management (DTM) has been developed and applied to mission/safety-critical applications running on a multi-core chip to avoid possible thermal hazards, and hence to avoid/reduce chip failure. Traditional DTM based on dynamic voltage and frequency scaling (DVFS) has limited applicability to modern multi-core chips due to increasing leakage power. Moreover, DTM must be applied in such a way that the underlying applications meet their timing constraints. In this paper, we explore a new class of DTM, namely, jobmigration and power-gating, and propose an efficient runtime thermal-aware scheduler (TAS) to avoid thermal hazards without violating critical applications’ timing constraints. Based on proactive thermal management using Support Vector Machines (SVMs), the TAS periodically estimates future core temperature and triggers a pre-defined thermal management scheme that migrates the jobs running on the estimated-to-be-overheated cores to other cooler cores. The overheated cores can then be turned off for a certain period of time to cool down, when necessary. With the schedulability condition checked by the TAS, all tasks’ timing constraints are guaranteed to be met even when jobs are migrated from overheated cores and some cores are powered off and on. Our in-depth evaluation based on the HotSpot thermal simulator has shown that the proposed algorithm effectively maintains the core temperature below a given threshold without violating any timing constraint.
منابع مشابه
Proposed Feature Selection for Dynamic Thermal Management in Multicore Systems
Increasing the number of cores in order to the demand of more computing power has led to increasing the processor temperature of a multi-core system. One of the main approaches for reducing temperature is the dynamic thermal management techniques. These methods divided into two classes, reactive and proactive. Proactive methods manage the processor temperature, by forecasting the temperature be...
متن کاملEcoMobile: Energy-aware Real-time Framework for Multicore Mobile Systems
In this paper, we propose a design and development plan to build an energy-aware framework for multicore mobile systems which extends Android with two features it lacks: fine-grain power management functionality and real-time support. We also provide three novel energy-aware real-time multicore scheduling algorithms to reduce both dynamic and leakage power consumptions. These algorithms address...
متن کاملHeVac: A Heterogeneous Vacation Scheme for Thermal-Aware Multicore Packet Processing
As processor power density increases, thermal and power control becomes critical for packet processing. Network applications feature ON/OFF execution pattern, which causes frequent temperature and power consumption changes in the processor. In this paper, we propose a novel power aware thermal management algorithm, which achieves power saving in multicore processors by employing a vacation sche...
متن کاملA Fast Scheme to Investigate Thermal-Aware Scheduling Policy for Multicore Processors
With more cores integrated into one single chip, the overall power consumption from the multiple concurrent running programs increases dramatically in a CMP processor which causes the thermal problem becomes more and more severer than the traditional superscalar processor. To leverage the thermal problem of a multicore processor, two kinds of orthogonal technique can be exploited. One is the co...
متن کاملTopology-Aware Scheduling on Blue Waters with Proactive Queue Scanning and Migration-Based Job Placement
Modern HPC systems, such as Blue Waters, has a multidimensional torus topology, which makes it hard to achieve both high system utilization and scheduling speed. The low scheduling speed comes from the inefficient resource allocation scheme by using a pre-defined Shape Table, which is highly time consuming. The low system utilization is majorly caused by system fragmentation and node drainage. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011